Rank in Wordlist | Frequency | Word |
---|---|---|
3814 | 54 | 2,5 |
4245 | 48 | 1,5 |
8720 | 21 | 1,2 |
8727 | 21 | 3,5 |
9955 | 18 | 0,5 |
10428 | 17 | 4,5 |
10430 | 17 | 7,5 |
10946 | 16 | 1,4 |
12218 | 14 | 1,8 |
13002 | 13 | 1,6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
5107 | 39 | 40% |
5761 | 34 | 10% |
5926 | 33 | 50% |
6087 | 32 | 60% |
7066 | 27 | 20% |
7303 | 26 | 30% |
8732 | 21 | 70% |
8734 | 21 | 90% |
9095 | 20 | 80% |
10955 | 16 | 5% |
Rank in Wordlist | Frequency | Word |
---|---|---|
11696 | 15 | R&B |
31291 | 4 | A&M |
49105 | 2 | A&R |
57806 | 2 | W&OD |
75667 | 1 | A&P |
75668 | 1 | A&Y |
75846 | 1 | AT&T |
84570 | 1 | D&Y |
89378 | 1 | GM&G |
90847 | 1 | H&M |
Rank in Wordlist | Frequency | Word |
---|---|---|
57510 | 2 | US$97.9 |
111684 | 1 | U$S |
111741 | 1 | US$0.02 |
111742 | 1 | US$0.20 |
111743 | 1 | US$0.25 |
111744 | 1 | US$10.000 |
111745 | 1 | US$100,038,390 |
111746 | 1 | US$115.8 |
111747 | 1 | US$115.9 |
111748 | 1 | US$121.9 |
Rank in Wordlist | Frequency | Word |
---|---|---|
888 | 221 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
51 | 2555 | d'un |
61 | 2060 | d'una |
84 | 1509 | que'l |
109 | 1196 | ye'l |
145 | 914 | como'l |
223 | 645 | d'esta |
243 | 609 | qu'el |
251 | 588 | d'esti |
307 | 505 | sobre'l |
342 | 477 | foi'l |
Rank in Wordlist | Frequency | Word |
---|---|---|
49810 | 2 | B1257+12 |
55045 | 2 | O+221Y |
72084 | 1 | 1704+481 |
72528 | 1 | 19550+4152 |
72752 | 1 | 2+2 |
75439 | 1 | 9+1 |
75669 | 1 | A+B |
78706 | 1 | B0633+17 |
78739 | 1 | BD+10 |
78740 | 1 | BD+41 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2345 | 89 | km/s |
6208 | 32 | km/h |
6451 | 31 | y/o |
16866 | 10 | hab/km² |
17008 | 10 | m³/s |
20279 | 8 | km/s— |
23421 | 6 | 1/3 |
23535 | 6 | 3/4 |
25674 | 6 | http://web |
26819 | 5 | 2/3 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots